Maintaining Spatial Data Sets in Distributed-Memory Machines
نویسندگان
چکیده
We propose a distributed data structure for maintaining spatial data sets on message-passing, distributed memory machines. The data structure is based on orthogonal bisection trees and it captures relevant characteristics of parallel machines. The operations we consider include insertion, deletion, and range queries. We introduce parameters to control how much imbalance is tolerated at each processor and to specify the load to be achieved during balancing. When balancing, we first broadcast point counts of a datadependent partition of the data. Based on this partition, we propose load balancing methods with different communication and computation requirements. We present initial experimental results for the Cray T3D.
منابع مشابه
Entropy-based Consensus for Distributed Data Clustering
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...
متن کاملDistributed and Memory Efficient Machine Learning for Spatial Analysis Applications
In the context of spatial analysis, spatial disaggregation or spatial downscaling are processes by which information at a coarse spatial scale is translated to finer scales, while maintaining consistency with the original dataset. Fine grained descriptions of geographical information is a key resource in fields such as social-economic studies, urban and regional planning, transport planning, or...
متن کاملCompiler Techniques for Determining Data Distribution and Generating Communication Sets on Distributed-Memory Machines
This paper is concerned with designing eficient algorithms for determining data distribution and generating communication sets on distributed memory multicomputers. First, we propose a dynamic programming algorithm to automaliically determine data distribution at compiling time. The proposed algorithm also can determine whether da-La redistribution is necessary between two consecutive DO-loop p...
متن کاملMunin: Distributed Shared Memory Based on Type{speciic Memory Coherence
We are developing Munin y , a system that allows programs written for shared memory multiprocessors to be executed eeciently on distributed memory machines. Thus, Munin overcomes the architectural limitations of shared memory machines, while maintaining their advantages in terms of ease of programming. A unique characteristic of Munin is the mechanism by which the shared memory programming mode...
متن کاملCompiling Array Statements for E cient Execution onDistributed - Memory Machines : Two - level
In languages such as High Performance Fortran (HPF), array statements are used for expressing data parallelism. In compiling array statements for distributed-memory machines, eecient enumeration of local index sets and communication sets is important. The virtual processor approach, among several other methods, has been proposed for eecient enumeration of these index sets. In this paper, using ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997